JuliaSyntax parser-based REPL completions overhaul #57767

xal-0 · 2025-03-13T23:38:17Z

Overview

As we add REPL features, bugs related to the ad-hoc parsing done by
REPLCompletions.completions have crept in. This pull request replaces most of
the manual parsing (regex, find_start_brace) with a new approach that parses
the entire input buffer once, before and after the cursor, using JuliaSyntax.
We then query the parsed syntax tree to determine the kind of completion
to be done.

Changes

New, JuliaSyntax-based completions mechanism.
The complete_line interface now has the option of replacing arbitrary
regions of text in the input buffer by returning a Region (Pair{Int, Int}
for consistency with the convention in LineEdit, and pos being a 0-based
byte offset). This is used to improve the handling of auto-inserted closing
delimiters.. Leaving this unchanged for now.
Fixes parsing-related bugs:
Fixes some bugs that exist on 28d3bd5 that were found by fuzzing:
- x \" + TAB throws a FieldError exception
- String completion would sometimes delete the entire input buffer.
- Completions should not happen inside comments.
The duplicate code for path completion in strings, Cmd-strings, and the
shell has been removed, causing paths to complete the same way for all three.
Now, ~ is expanded in two situations:
- If foo exists, or if foo does not exist but there are no
  possible completions:
```
"~/foo/b|"     =TAB=>   "~/foo/bar|"
"~/foo/bar|"   =TAB=>   "/home/user/foo/bar|"
   OR
"~/foo/bar"|   =TAB=>   "/home/user/foo/bar"|
```
- If the current path ends with a / and you hit TAB again:
```
"~/foo/|"      =TAB=>   "/home/user/foo/|"
   OR
"~/foo/"|      =TAB=>   "/home/user/foo/"|
```

Future work

Method completions could be changed to look for methods with exactly the given
number of arguments if the closing ) is present, and search for signatures
with the right prefix otherwise.
- It would be nice to be able to search by type as well as value (perhaps
  by putting ::T in place of arguments).
Other REPL features could benefit from JuliaSyntax, so it might be worth
sharing the parse tree between completions and other features:
- Emacs-style sexpr navigation: C-M-f/C-M-b/C-M-u, etc.
- Improved auto-indent.
It would be nice if hints worked even when the cursor is between text.
CursorNode is a slightly tweaked copy of SyntaxNode from JuliaSyntax that
tracks the parent node but includes all trivia. It is used with seek_pos,
which navigates to the innermost node at a given position so we can examine
nearby nodes and the parent. This could probably duplicate less code from
JuliaSyntax.

Adds another permitted return type for complete_line, where the second element of the tuple is a Region (a Pair{Int, Int}) describing the region of text to be replaced. This is useful for making completions work consistently when the closing delimiter may or may not be present: the cursor can be made to "jump" out of the delimiters regardless of whether it is there already. "exam| =TAB=> "example.jl"| "exam|" =TAB=> "example.jl"|

This commit replaces the heuristic parsing done by REPLCompletions.completions with a new approach that parses the entire input buffer once with JuliaSyntax. In addition to fixing bugs, the more precise parsing should allow for new features in the future. Some features now work in more situations "for free", like dictionary key completion (the expression evaluated to find the keys is now more precise) and method suggestions (arguments beyond the cursor can be used to narrow the list). The tests have been updated to reflect slightly differing behaviour for string and Cmd-string completion: the new code returns a character range encompassing the entire string when completing paths (not observable by the user), and the behaviour of '~'-expansion has be tweaked to be consistent across all places where paths can be completed. Some escaping issues have also been fixed. Fixes: JuliaLang#55420, JuliaLang#55518, JuliaLang#55520, JuliaLang#55842, JuliaLang#56389, JuliaLang#57611

IanButterworth · 2025-03-14T02:43:30Z

This sounds great! I think we should backport it to at least 1.12 given it fixes so many issues.

giordano

@xal-0 first of all, speaking as the author of most of the issues linked to this PR, thank you for tackling this!

Also, can you please add a test for #57772 as well? I verified this works already (which is amazing!)

stdlib/REPL/test/replcompletions.jl

StefanKarpinski · 2025-03-14T15:25:17Z

Bravo. Truly excellent quality-of-life PR 👏🏻

giordano · 2025-03-14T17:40:12Z

Bravo. Truly excellent quality-of-life PR 👏🏻

Best part is that this PR has a negative diff, depsite the fact it added many tests.

stdlib/REPL/test/replcompletions.jl

…JuliaLang#57307, JuliaLang#57624 Also fix test failing for silly reason

xal-0 · 2025-03-26T00:04:49Z

I am going to disable the shell completion tests until the shell mode can parse Windows paths...

shell> cd C:\Users
ERROR: IOError: cd("C:Users"): no such file or directory (ENOENT)
Stacktrace:
 [1] uv_error
   @ .\libuv.jl:106 [inlined]
 [2] cd(dir::String)
   @ Base.Filesystem .\file.jl:91
 [3] repl_cmd(cmd::Cmd, out::Base.TTY)
   @ Base .\client.jl:64
 [4] top-level scope
   @ none:1

Also cleans up do_cmd_escape, so that it can use different escaping syntax from the shell mode (which we may want to make similar to cmd.exe on Windows).

vtjnash · 2025-03-28T14:53:28Z

until the shell mode can parse Windows paths

The shell mode has no problem with Windows paths which is why the REPL has tests for it. It just sounds like you lost a call to Base.shell_escape on that code path.

stdlib/REPL/src/REPLCompletions.jl

vtjnash · 2025-03-28T15:48:26Z

stdlib/REPL/src/REPLCompletions.jl

 end

-function shell_completions(string, pos, hint::Bool=false)
+function shell_completions(string, pos, hint::Bool=false; cmd_escape::Bool=false)


Since we always use the shell parser, why would we ever want this set true? I believe in the past, the code decided between whether it was completing one argument or several arguments, with escaping thus either wanting to include or exclude spaces. And thus choosing to fail if the input contained space-separate arguments rather than handling them, so I don't know why that would ever be useful.

vtjnash · 2025-03-28T16:01:35Z

stdlib/REPL/src/REPLCompletions.jl

-    # escape_raw_string with delim='`' and ignoring the rule for the ending \
-    return replace(s, r"(\\+)`" => s"\1\\`")
+function do_cmd_escape(s)
+    return Base.shell_escape_posixly(Base.escape_raw_string(s, '`'))


This seems like nonsense: I can't think of any case where we'd expect it'd be useful for the the REPL to pass the string unmodified to a posix shell for use as an argument in a julia script, which is what this transform order implements.

The previous transform took a string text that was already in a valid posix-shell form (e.g. for raw input to shell_parse for shell> text mode) and corrects it for the julia parser (e.g. for :(`text`)). Usually our shell_escape_posixly attempts to add quotes to avoid this edge case complexity, but if we have chosen to preserve the user-typed syntax then this is needed

julia> a = `\\ b\\\\` `'\' 'b\'`

I intended cmd_escape to make it possible to complete shell commands inside Cmd-strings, but mixed up the order (Base.escape_raw_string(Base.shell_escape_posixly(s), '`') does what I want I think):

julia> println(R.do_cmd_escape("file ` 1")) 'file \` 1' julia> @macroexpand `'file \` 1'` :(Base.cmd_gen((("file ` 1",),)))

Completion with shell_escape_posixly is still unwieldy because you get partial completions that insert only the opening quote, and can't complete any further.

stdlib/REPL/src/REPLCompletions.jl

Co-authored-by: Jameson Nash <[email protected]>

IanButterworth · 2025-04-14T14:06:11Z

What's the status of this? I'm just wondering whether it's worth getting it in and continuing work in PRs?

xal-0 · 2025-04-14T21:47:58Z

With the REPL completions working well enough on Windows again, this is ready for review again. 👍

KristofferC · 2025-04-22T14:05:13Z

🎤 ⬇️

IanButterworth · 2025-04-22T14:43:02Z

As someone who tried and failed to fix the issues this closes by patching the old approach, thank you for fixing this properly!

# Overview As we add REPL features, bugs related to the ad-hoc parsing done by `REPLCompletions.completions` have crept in. This pull request replaces most of the manual parsing (regex, `find_start_brace`) with a new approach that parses the entire input buffer once, before and after the cursor, using JuliaSyntax. We then query the parsed syntax tree to determine the kind of completion to be done. # Changes - New, JuliaSyntax-based completions mechanism. - The `complete_line` interface now has the option of replacing arbitrary regions of text in the input buffer by returning a `Region` (`Pair{Int, Int}` for consistency with the convention in LineEdit, and `pos` being a 0-based byte offset). - Fixes parsing-related bugs: - fix #55420 - fix #55429 - fix #55518 - fix #55520 - fix #55842 - fix #56389 - fix #57307 - fix #57611 - fix #57624 - fix #58099 - Fixes some bugs that exist on 28d3bd5 that were found by fuzzing: - `x \"` + `TAB` throws a `FieldError` exception - String completion would sometimes delete the entire input buffer. - Completions should not happen inside comments. - The duplicate code for path completion in strings, `Cmd`-strings, and the shell has been removed, causing paths to complete the same way for all three. Now, `~` is expanded in two situations: - If `foo` exists, or if `foo` does not exist but there are no possible completions: ``` "~/foo/b|" =TAB=> "~/foo/bar|" "~/foo/bar|" =TAB=> "/home/user/foo/bar|" OR "~/foo/bar"| =TAB=> "/home/user/foo/bar"| ``` - If the current path ends with a `/` and you hit TAB again: ``` "~/foo/|" =TAB=> "/home/user/foo/|" OR "~/foo/"| =TAB=> "/home/user/foo/"| ``` # Future work - Method completions could be changed to look for methods with exactly the given number of arguments if the closing `)` is present, and search for signatures with the right prefix otherwise. - It would be nice to be able to search by type as well as value (perhaps by putting `::T` in place of arguments). - Other REPL features could benefit from JuliaSyntax, so it might be worth sharing the parse tree between completions and other features: - Emacs-style sexpr navigation: `C-M-f`/`C-M-b`/`C-M-u`, etc. - Improved auto-indent. - It would be nice if hints worked even when the cursor is between text. - `CursorNode` is a slightly tweaked copy of `SyntaxNode` from JuliaSyntax that tracks the parent node but includes all trivia. It is used with `seek_pos`, which navigates to the innermost node at a given position so we can examine nearby nodes and the parent. This could probably duplicate less code from JuliaSyntax. (cherry picked from commit ff0a931)

# Overview As we add REPL features, bugs related to the ad-hoc parsing done by `REPLCompletions.completions` have crept in. This pull request replaces most of the manual parsing (regex, `find_start_brace`) with a new approach that parses the entire input buffer once, before and after the cursor, using JuliaSyntax. We then query the parsed syntax tree to determine the kind of completion to be done. # Changes - New, JuliaSyntax-based completions mechanism. - The `complete_line` interface now has the option of replacing arbitrary regions of text in the input buffer by returning a `Region` (`Pair{Int, Int}` for consistency with the convention in LineEdit, and `pos` being a 0-based byte offset). - Fixes parsing-related bugs: - fix JuliaLang#55420 - fix JuliaLang#55429 - fix JuliaLang#55518 - fix JuliaLang#55520 - fix JuliaLang#55842 - fix JuliaLang#56389 - fix JuliaLang#57307 - fix JuliaLang#57611 - fix JuliaLang#57624 - fix JuliaLang#58099 - Fixes some bugs that exist on 28d3bd5 that were found by fuzzing: - `x \"` + `TAB` throws a `FieldError` exception - String completion would sometimes delete the entire input buffer. - Completions should not happen inside comments. - The duplicate code for path completion in strings, `Cmd`-strings, and the shell has been removed, causing paths to complete the same way for all three. Now, `~` is expanded in two situations: - If `foo` exists, or if `foo` does not exist but there are no possible completions: ``` "~/foo/b|" =TAB=> "~/foo/bar|" "~/foo/bar|" =TAB=> "/home/user/foo/bar|" OR "~/foo/bar"| =TAB=> "/home/user/foo/bar"| ``` - If the current path ends with a `/` and you hit TAB again: ``` "~/foo/|" =TAB=> "/home/user/foo/|" OR "~/foo/"| =TAB=> "/home/user/foo/"| ``` # Future work - Method completions could be changed to look for methods with exactly the given number of arguments if the closing `)` is present, and search for signatures with the right prefix otherwise. - It would be nice to be able to search by type as well as value (perhaps by putting `::T` in place of arguments). - Other REPL features could benefit from JuliaSyntax, so it might be worth sharing the parse tree between completions and other features: - Emacs-style sexpr navigation: `C-M-f`/`C-M-b`/`C-M-u`, etc. - Improved auto-indent. - It would be nice if hints worked even when the cursor is between text. - `CursorNode` is a slightly tweaked copy of `SyntaxNode` from JuliaSyntax that tracks the parent node but includes all trivia. It is used with `seek_pos`, which navigates to the innermost node at a given position so we can examine nearby nodes and the parent. This could probably duplicate less code from JuliaSyntax.

xal-0 added bugfix This change fixes an existing bug completions Tab and autocompletion in the repl labels Mar 13, 2025

xal-0 requested a review from vtjnash March 13, 2025 23:38

xal-0 force-pushed the juliasyntax-repl branch from 385cc7f to b62b012 Compare March 13, 2025 23:40

xal-0 mentioned this pull request Mar 13, 2025

REPL completions: Enter import mode only when cursor beyond "import" #57473

Closed

xal-0 added 2 commits March 13, 2025 16:49

xal-0 force-pushed the juliasyntax-repl branch from b62b012 to c539ec4 Compare March 13, 2025 23:49

xal-0 added the don't squash Don't squash merge label Mar 14, 2025

IanButterworth added the backport 1.12 Change should be backported to release-1.12 label Mar 14, 2025

giordano mentioned this pull request Mar 14, 2025

!ismi^tab does not tab complete inside function call #57772

Closed

giordano reviewed Mar 14, 2025

View reviewed changes

stdlib/REPL/test/replcompletions.jl Outdated Show resolved Hide resolved

IanButterworth added the REPL Julia's REPL (Read Eval Print Loop) label Mar 14, 2025

xal-0 removed the don't squash Don't squash merge label Mar 14, 2025

giordano reviewed Mar 14, 2025

View reviewed changes

stdlib/REPL/test/replcompletions.jl Outdated Show resolved Hide resolved

xal-0 added 2 commits March 14, 2025 11:54

REPL: Add tests for JuliaLang#55420, JuliaLang#55429, JuliaLang#55842, …

304aefa

…JuliaLang#57307, JuliaLang#57624 Also fix test failing for silly reason

Add test for JuliaLang#57772

498aec3

xal-0 added 3 commits March 25, 2025 17:32

REPL: Escape path completions after joining dir and path

9980020

Also cleans up do_cmd_escape, so that it can use different escaping syntax from the shell mode (which we may want to make similar to cmd.exe on Windows).

Merge remote-tracking branch 'upstream/master' into juliasyntax-repl

69db0a5

REPL: new backslash escape hack for Windows Pkg completions

d48fd5e

xal-0 force-pushed the juliasyntax-repl branch from 60dec58 to d48fd5e Compare March 26, 2025 19:48

vtjnash reviewed Mar 28, 2025

View reviewed changes

stdlib/REPL/src/REPLCompletions.jl Show resolved Hide resolved

vtjnash reviewed Mar 28, 2025

View reviewed changes

stdlib/REPL/src/REPLCompletions.jl Outdated Show resolved Hide resolved

KristofferC mentioned this pull request Mar 31, 2025

Backports release 1.12 #57955

Merged

36 tasks

xal-0 and others added 3 commits March 31, 2025 14:54

Update stdlib/REPL/src/REPLCompletions.jl

8cd20d4

Co-authored-by: Jameson Nash <[email protected]>

Use '/' as directory separator for shell completions

8dee0ff

Merge remote-tracking branch 'upstream/master' into juliasyntax-repl

e2eef34

KristofferC mentioned this pull request Apr 4, 2025

Backports for 1.12.0-beta2 #58009

Merged

51 tasks

xal-0 added 3 commits April 7, 2025 08:00

Restore previous Pkg.jl path completion

14dde09

Merge remote-tracking branch 'upstream/master' into juliasyntax-repl

8ee2978

Allow REPL completions inside string interpolation

5839241

oscardssmith mentioned this pull request Apr 14, 2025

autocompletion not working after <: #58099

Closed

vtjnash merged commit ff0a931 into JuliaLang:master Apr 22, 2025
8 checks passed

j-fu mentioned this pull request Apr 26, 2025

Julia 1.12.0-beta2 support JunoLab/FuzzyCompletions.jl#22

Open

KristofferC mentioned this pull request Apr 29, 2025

Backports for 1.12.0-beta3 #58270

Merged

53 tasks

KristofferC removed the backport 1.12 Change should be backported to release-1.12 label May 5, 2025

aviatesk mentioned this pull request Jun 2, 2025

Implement signature help aviatesk/JETLS.jl#34

Merged

Uh oh!

JuliaSyntax parser-based REPL completions overhaul #57767

JuliaSyntax parser-based REPL completions overhaul #57767

Uh oh!

Conversation

xal-0 commented Mar 13, 2025 • edited by giordano Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Overview

Changes

Future work

Uh oh!

IanButterworth commented Mar 14, 2025

Uh oh!

giordano left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

StefanKarpinski commented Mar 14, 2025

Uh oh!

giordano commented Mar 14, 2025

Uh oh!

Uh oh!

xal-0 commented Mar 26, 2025

Uh oh!

vtjnash commented Mar 28, 2025

Uh oh!

Uh oh!

vtjnash Mar 28, 2025

Choose a reason for hiding this comment

Uh oh!

vtjnash Mar 28, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

xal-0 Mar 31, 2025

Choose a reason for hiding this comment

Uh oh!

xal-0 Mar 31, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

IanButterworth commented Apr 14, 2025

Uh oh!

xal-0 commented Apr 14, 2025

Uh oh!

Uh oh!

KristofferC commented Apr 22, 2025

Uh oh!

IanButterworth commented Apr 22, 2025

Uh oh!

Uh oh!

xal-0 commented Mar 13, 2025 •

edited by giordano

Loading

vtjnash Mar 28, 2025 •

edited

Loading